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PROBE 

The present invention relates to DNA probes for 
detecting a tandemly-repeated nucleotide sequence in the 
gene encoding mucin glycoprotein expressed by human 
mammary epithelial cells, to the use of the probe in 
diagnosis and in "fingerprinting" individuals, to the 
polypeptides expressed by the corresponding mucin gene, to 
antibodies against the polypeptides and to the use of the 
polypeptides and antibodies in the diagnosis and 
therapeutic treatment of cancer. 

Normal and malignant human mammary epithelial 
cells express high molecular weight glycoproteins (gps) 
which are extensively glycosylated and very antigenic. As 
a result, many of the monoclonal antibodies (MAbs) selected 
for reactivity with human breast cancer and other 
carcinomas are found to react with molecules which are 
produced in abundance by the fully differentiated human 
mammary tissue and are found in the milk fat globule (MFG) 
and in milk. However, the level of expression of a 
particular antigenic determinant may be different in the 
gps produced by the normal differentiated cell and in the 
similar molecules produced by breast cancers. This means 
that some antibodies can show a certain specificity for 
reacting with tumour gps. 
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The molecules bearing the epitopes recognised by 
these antibodies are complex and have been difficult to 
analyse,, both because they are large and heavily 
glycosylated (>25G,Q00 daltons) and because of the complex 
pattern of expression. Two of the MAbs, HMFG-1 and -2, 
react with a component in human milk which appears to be 
greater than 400,000 daltons, whereas the molecules found 
in sera and tumours are smaller, although the dominant 
components are still greater than 200,000 daltons on 
immunoblots. The large glycoprotein produced by the 
differentiated mammary epithelial cells found in human milk 
or in the milk fat globule has been purified and shown to 
have some of the characteristics of the mucins. This 
component ' contains a large amount of carbohydrate joined in 
O-linkage to serine and threonine residues via the linkage 
sugar N-acetylgalactosamine. Moreover, the core protein 
contains high levels of serine, threonine and proline and 
low levels of aromatic and sulphur containing amino acids • 

These mucin-like glycoproteins are also secreted 
by a number of other. normal epithelial cells. The 
monoclonal antibody HMFG-1 is highly reactive with the milk 
mucin and evidence suggests that the epitope recognised by 
this antibody is more abundant on the fully processed 
mucin, characteristic of normal differentiation. 

In tumours, the molecular weight of the molecules 
carrying these antigenic determinants differs among 
individual tumours and, in the case of the components 
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recognised by the HMFG-2 antibody, can range from 80-400K 
daltons. Although it appears that the differences observed 
.in the mobility of the high molecular weight bands are due 
to genetic polymorphism this probably does not explain 
variations in the size of the lower bands. It has been 
proposed that these may be the result of aberrant 
processing occurring in the tumour cell possibly within the 
glycosylation pathways. 

For the majority of the monoclonal antibodies 
reacting with this group of molecules the exact nature of 
the antigenic epitopes remains unclear but circumstantial 
evidence has suggested that carbohydrate may at least be 
partly involved in many of the epitopes. Moreover, from 
previously available data it was not known whether the 
mucin found in the normal differentiated cells, and that 
observed in the tumours, contain the same core protein, or 
just carry common carbohydrate determinants. 

Mucin has now been isolated from human milk by 
affinity chromatography enabling identification of the core 
protein and the gene encoding the protein. This has been 
found to be a highly polymorphic gene defined by the peanut 
urinary mucin (PUM) locus [see Swallow et al. , Disease 
Markers , 247, (1986) and Nature , 327 , 82-84 (1987)]. 
The gene product, which is hereafter referred to as human 
polymorphic epithelial mucin or HPEM, has been detected in 
breast tumours and other carcinomas as well as in some 
normal epithelial tissues. 
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It has now been found that the HPEM core protein 
has epitopes which also appear in the aberrantly processed 
gps produced by adenocarcinoma cells. Certain of these 
epitopes are not exposed in the fully processed mucin, 
glycoprotein produced by the lactating mammary gland. 

In one aspect the present invention therefore 
provides an antibody against a human mucin core protein 
which antibody substantially does not react with a fully 
processed human mucin glycoprotein. 

As used herein the term "antibody" is intended to - 
include fragments of antibodies bearing antigen binding 
sites such as the F(ab') 2 fragments. 

Antibodies according to. the present invention 
react with HPEM core protein, especially as expressed by 
colon, lung, ovary and particularly breast carcinomas, but 
have reduced or no reaction with the corresponding fully 
processed HPEM. In a particular aspect the antibodies 
react with HPEM core protein but not with fully processed 
HPEM glycoprotein as produced by the normal lactating human 
mammary gland. 

Antibodies according to the present invention 
have no significant reaction with the mucin glycoproteins 
produced by pregnant or lactating mammary epithelial 
tissues but react with the mucin proteins expressed by 
mammary epithelial adenocarcinoma cells. These antibodies 
show a much reduced reaction with benign breast tumours and 
are therefore useful in the diagnosis and localisation of 
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breast cancer as well as in therapeutic methods. 

The antibodies may be used for other purposes 
including screening cell cultures for the polypeptide 
expression product of the human mammary epithelial mucin 
gene r or fragments thereof, particularly the nascent 
expression product. In this case the antibodies may 
conveniently be polyclonal or monoclonal antibodies. 

Antibodies according to the present invention may 
be produced by innoculation of suitable animals with HPEM 
core protein or a fragment thereof such as the peptides 
described below. Monoclonal antibodies are produced by the 
method of Kohler & Milstein (Nature 256, 495-497/1975) by 
immortalising spleen cells from an animal innoculated with 
the mucin core protein or a fragment thereof, usually by 
fusion with an immortal cell line (preferably a myeloma 
cell line) , of the same or a different species as the 
innoculated animal, followed by the appropriate cloning and 
screening steps. 

In a particular aspect the present invention 
provides the monoclonal antibodies designated SM3 against 
the HPEM core -protein. In another aspect the invention 
provides the hybridoma cell line which secretes the 
antibodies SM3 and has been designated HSM3. Samples of 
HSM3 have been deposited with ECACC on 7th January 1987 
under accession number 87010701. 

Using antibodies according to the invention it 
has been possible to screen a phage library constructed 
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from mRNA isolated from a human breast cancer cell line to 
identify sequences coding for portions of the mucin core 
protein. Complementary DNA sequences have been constructed 
and from these it has surprisingly been found that the gene 
encoding the core protein contains multiple tandem repeats 
of a 60 base sequence leading to considerable polymorphism 
sufficiently extensive that cDNA fragments corresponding to 
the repeat sequence would be useful for fingerprinting DNA. 
The fingerprinting thus made possible has applications in 
for instance ascertaining whether bone marrow growth after 
transplants is from the host or the donor and in forensic 
medicine for identifying individuals using body . tissues or 
fluids. 

Accordingly the present invention also provides a 
nucleic acid fragment comprising at least 17 nucleotide 
bases the fragment being hybridisable with at least one of 

a) the DNA sequence 

5» * 

ACC GTG GGC TGG GGG GGC GGT GGA GCC CGG- 

GGC CGG CCT GGT GTC CGG GGC CGA GGT GAC- 

* 

ACC GTG GGC TGG GGG GGC GGT GGA GCC CGG- 

3' 

GGC CGG CCT GGT GTC CGG GGC CGA GGT GAC 

b) DNA complementary to the DNA of a) f i.e. of sequence 

GTC ACC TCG GCC CCG GAC ACC AGG CCG GCC- 
* 

CCG GGC TCC ACC GCC CCC CCA GCC CAC GGT- 

GTC ACC TCG GCC CCG GAC ACC AGG CCG GCC- 
* 3 1 
CCG GGC TCC ACC GCC CCC CCA GCC CAC GGT 
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c) RNA having a sequence corresponding to the DNA sequence 
of a) and 

d) RNA having a sequence corresponding to the complementary 

DNA sequence of b) . 

The sequences in (a) and (b) each include a double 
tandem repeat sequence of 120 bases. Fragments according to 
the invention may correspond to any portion of this sequence 
including portions bridging the start point of the repeat. 

Fragments according to the invention will 
hybridise under conditions of low stringency with the DNA 
and RNA sequences (a) to (d) above. Preferred fragments are 
those which also hybridise under conditions of high 
stringency. The most preferred fragments of the invention 
are those which have sequences exactly identical to, or 
exactly complementary to the sequences (a) to (d) above. 

Normally a given DNA or RNA fragment according to 
the invention will be capable of hybridising with both DNA 
according to a) and RNA according to c) or with both DNA 
according to b) and RNA according to d) above. 

Preferably the nucleic acid fragment according to 
the present invention will comprise a portion of at least 30 
nucleotide bases capable of hybridising with at least one of 

a) to d) above, more preferably at least 50 such bases and 
most preferably the fragment contains a sequence of 60 bases 
exactly complementary to one of the repeat sequences of a) , 

b) c) or d) above. Other fragments of the invention may 
comprise two or more repeats of such a sequence , optionally 
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with minor, variations by way. of substitution. Preferably 
such fragments include an integral number of such repeat 
sequences. Further fragments of the invention comprise the 
tandem repeat sequence and additional coding or non-coding 
5* and/or 3' flanking sequences corresponding to the HPEM 
gene or a portion thereof. 

When the existence of a tandem repeat sequence was 
first identified it was believed that the sequence consisted 
of 59 base pairs corresponding with the sequences indicated 
in (a) and (b) above except for the lack of the base 
indicated with "* n . 

Many fragments according to the invention as 
originally defined in British Patent Application No. 8700269 
also conform with the new definition of fragments as set out 
herein and those fragments of sequences defined under (a) , 
(b) , (c) or (d) above which do not include the bases marked 
n * M form a particular aspect of the present invention. Such 
fragments have sequences corresponding to at least a portion 
of the sequences 

a 1 ) GTG GGC TGG GGG GGC GGT GGA GCC 

a' ') CGG GGC CGG CCT GGT GTC CGG GGC CGA GGT GAC AC 

b f ) DNA complementary to the sequence of a') or a 11 )* 

c 1 } RNA having a sequence corresponding to the sequence of 

a f ) or a' ' ) and 
d f ) RNA haying a sequence corresponding to one of the 

complementary DNA sequences of b 1 ) 

In the human genome, the DNA tandem repeat sequence 



WO 88/05054 



PCT/GB88/00011 



-9- 

comprises antiparallel double stranded DNA, one strand 
having sequence (a) and being paired with a strand having 
sequence (b) . 

As mentioned above the nucleic acid fragments of 
the invention may be used as a probe for detecting one or 
other strand of the DNA tandem repeat sequence in the human 
genome, or RNA transcribed from either strand and hence for 
identifying the gene or genes for human mucin core proteins, 
mRNA transcribed therefrom and complementary DNA and RNA. 
For such purposes it may be convenient to use the complete 
normal gene comprising at least one tandem repeat sequence, 
or mRNA transcribed therefrom or to attach non-complementary 
fragments to either or both the 5 f and 3 f ends of a fragment 
according to the invention and/or to attach detectable 
labels (such as radioisotopes, fluorescent or enzyme labels) 
to the probe or to bind the probe to a solid support. All 
of these may be achieved by conventional methods and the 
nucleic acid fragments of the invention may be produced de 
novo by conventional nucleic acid synthesis techniques. 

The nucleic acid fragments of the present 
invention may also be used in active immunisation 
techniques. In such methods the fragment codes for a 
polypeptide chain substantially identical to a portion of 
the mucin core protein and may be extended at either or both 
the 5' and 3' ends with further coding or non-coding nucleic 
acid sequences including regulatory and promoter sequences, 
marker sequences and splicing or ligating sites. Coding 
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sequences may code for corresponding portions of the mucin 
core protein chain or for other polypeptide chains. The 
fragment according to the invention/ together with any 
necessary or desirable flanking sequences is inserted, in an 
appropriate open reading frame register, into a suitable 
vector, such . as a plasmid or a viral genome (for instance 
vaccinia virus genome) and is then expressed as a 
polypeptide product by conventional techniques. In one 
aspect the polypeptide product may be produced by culturing 
appropriate cells transformed with a vector, harvested and 
used as an immunogen to induce, active immunity against the 
mucin core protein.. In another aspect the vector, 
particularly in the form of a virus, may be directly 
innoculated into a human or animal to.be immunised. The 
vector then directs expression of the polypeptide in vivo 
and this in turn serves as an immunogen to induce active 
immunity against. the mucin core protein. 

The invention therefore provides nucleic acid 
fragments as hereinbefore defined for use in methods of 
treatment of the human or animal body by surgery or therapy 
and in diagnostic methods practised on the human or animal 
body. The invention also provides such methods for 
treatment of the human or animal body by surgery or therapy 
and diagnostic methods practised in vivo as well as ejc vivo 
and in- vitro . 

The invention further provides a polypeptide 
comprising a series of residues encoded by the DNA tandem 
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repeat sequence, the sequence shown at (b) above being the 
coding sequence. Polypeptides according to the invention 
are selected from any of those having 5 or more amino acid 
residues represented by the following amino acid sequence 

Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr 
Ala Pro Pro Ala His Gly*Val Thr Ser Ala Pro Asp Thr Arg 
Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly 

marks the start of the repeat in the peptide) . 

Polypeptides according to the invention may have 
a sequence corresponding with any portion of the 40 
residue sequence above and may include the start point of 
the repeat sequence. 

Other polypeptides according to the invention 
include three or more repeats of the 20 amino acid repeat 
sequence. Such polypeptides may include minor variations 
by way of substitution of individual amino acid residues. 

The invention further provides polypeptides as 
defined above modified by addition of N-acetyl 
galactosamine (a linkage sugar) on serine and/or threonine 
residues and by addition of oligosaccharide moieties to 
that or via other linkage sugars and/or fragments linked 
to carrier proteins such as keyhole limpet haemocyanin r 
albumen or thyroglobulin. 

Preferably the polypeptide comprises at least 10 
amino acid residues of the sequence above, more preferably 
20 residues. The polypeptide may comprise the full 
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sequence above. Such polypeptides may further comprise 
additional amino acid. residues, preferably conforming to 
the amino acid sequence of HPEM core protein. 

In a further aspect the present invention 
provides the HPEM core protein. This is encoded by the 
PUM gene and may be produced by recombinant DNA techniques 
and expressed without glycosylation in human or non-human 
cells. Alternatively it may be obtained by stripping 
carbohydrate from native human mucin glycoprotein which 
itself may be produced by isolation from samples of human 
tissue or body fluids or by expression and full processing 
in a human cell line. The HPEM core protein may be used 
for raising antibodies in animals for use in passive 
immunisation , diagnostic tests and tumour localisation and 
in active immunisation of humans. 

The invention further provides antibodies 
(monoclonal or polyclonal) , and fragments thereof, against 
any of the polypeptides described above. Such antibodies 
may be obtained by conventional methods and are useful in 
diagnostic arid therapeutic applications. 

The invention further provides antibodies 
(monoclonal or polyclonal ), or fragments thereof, linked 
to therapeutically or diagnostically effective ligands. 
For therapeutic use of the antibodies the ligands are 
lethal agents to be delivered to cancerous breast or other 
tissue in order to incapacitate or kill transformed cells. 
Lethal agents include toxins, radioisotopes and 'direct 
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killing agents' such as components of complement as well 
' as cytotoxic or other drugs. Further therapeutic uses of 
the antibodies inclusive passive immunisation. 

The invention further provides therapeutic 
methods comprising the administration of effective 
non-toxic amounts of such antibodies or fragments thereof 
and antibodies or fragments thereof for use in therapeutic 
treatment of the human or animal body. Especially in 
therapeutic applications it may be appropriate to modify 
the antibody by coupling the Fab region thereof to the Fc 
region of antibodies derived from the species to be 
treated (e.g. such that the Fab region of mouse monoclonal 
antibodies may be administered with a human Fc region to 
avoid immune response by a human patient) or in order to 
vary the isotype of the antibody. 

In the diagnostic field the antibodies may be 
linked to ligands such as solid supports and detectable 
labels such as enzyme labels, chromophores and 
fluorophores as well as radioisotopes and other directly 
or indirectly detectable labels. Preferably monoclonal 
antibodies or fragments thereof are used in diagnosis. 

The invention further provides a diagnostic 
assay method comprising contacting a sample suspected to 
contain abnormal human mucin glycoproteins with an 
antibody as defined above. Such methods include tumour 
localisation involving administration to the patient of 
the antibody or fragment thereof bearing a detectable 
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label or of an antibody or fragment thereof and, 
separately simultaneously or sequentially in either order 
a labelling entity capable of selectively binding the 
antibody or fragment thereof. The invention also provides 
antibodies or fragments thereof for use in diagnostic 
methods practised on the human or animal body* 

Particular uses of the antibodies include 
diagnostic assays for detecting and/or assessing the 
severity of breast, ovary and lung cancers. 

Diagnostic test kits are provided for use in 
diagnostic assays and comprise antibody or a fragment 
thereof, optionally suitable labels and other reagents 
and, especially for use in competitive assays, standard 
sera. 

The invention will now be illustrated by the 
following Examples and with reference to the figures of 
the accompanying drawings in which 
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figure LegendB 

figure 1: Purification of the Bilk mucin by imaunoa^f inity 
chromatography using the antibody »0>G-1. Kilks from aereral 
Indiriduala were combined and aba orbed to a BffG-l-Sepharoae column as 
described in Methods. The material alutin* at low pH was iodinated 
and subjected to PAGE electrophoresis and auto radiography (track 1). 
The iodinated material was precipitated using the Protein A method 
with antibodies Hl£PG-1 (track 5), (track 2), ET254 (track 3) 

and RP1£I 4 20% FCS (track 4) . 

figure 2 : Comparison of the 125 I labelled purified milk mucin with 
ir=unoblot of human skimmed milk. A, human akimmed milk was subjected 
to SDS polyacrylaaide electrophoresis, transferred to nitrocellulose 
paper, the blot probed with the monoclonal antibody H2£PG-1 and binding 
detected using an ZLISA method, B, after purification on an HlffO-1 
affinity column followed by G75 Saphadex chromatography the milk mucin 
was iodinated by the Bolton and Hunter method and subjected to SDS 
polyacrylamide electrophoresis and autoradiography. 

Figure 5: Autoradiography of the iodinated milk mucin after treatment 
with hydrogen fluoride. The purified milk mucin was treated with HP 
for 5 hours at room temperature (traek 1) or 1 hour at 4*C (track 2) 
and the resulting preparations were then iodinated and run on SDS 
polyacrylamide gala. 
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?^ are 1 ; ^activity of the intact, partially Gripped or extensively 
■tripped nilk mucin with iodinated lectins. The purified intact milk 
auein (track 1), the mucin treated with HF for 1 hour at 4*C (track 2) 
and the mucin treated for S boure at room temperature (track 5) were 
■objected to SDS polyaerylamide electrophoresis and then transferred 
to nitrocellulose paper. The paper was then probed with 125 I PKA 
(peanut agglutinin), 125 I WGA (wheat ( .m agglutinin), or 125 I HPA 
(Helix pomatia agglutinin) . 

Pi pure 5; Taauneprecipitation end issunohlots of the partially and 
extensively stripped mucin, a, the 125 I extensively. stripped mucin 
was icsunoprecipiteted with SU-Z (track 5), KKFG-2 (track 2) or KS2 
medius as a control (track 1) by the protein A plate aethod (see 
Materials and Methods). B. the partially stripped aucin (track 1) or 
extensively stripped mucin' (track 2) was run on SDS polyaerylamide 
gels and transferred to nitrocellulose paper. The blot was then 
reacted with a cocktail of SH-S and SM-4 monoclonal antibodies and the 
binding detected using an ELISA oethod. 

Zi£2IS_§: Reactivity of monoclonal antibodies SM-3 and HKFG-2 with 
aethaearn fixed breast tissue and tumour sections using an indirect 
iaaunoperoxidase ataining method. Infiltrating ductal carcinoma 
■bowing atrong reactivity with both SM-3 (A) and KHPG-2 (B) 
fibroadenoma ■bowing no reactivity with SM-3 (C) and atrong 
heterogeneous .taining of the epithelium with HMPG-2 (D). Papilloma 
■bowing wery weak reactivity with SM-S (2) and strong po«itivity with 
HMFG-2 (P). Both normal resting breast (G) and lactating breast (I) 
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positively with HU7G-2 with laetating breast (J) snieh str nger than 
aonaal resting breast (H). 
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Figure Legends 

Figure 7, Periodic acid-silver stained milk mucin after antibody 
affinity column and gel filtration column. Milk mucin was purified on ■ 
an HMFG-1 antibody affinity column (lane 1 ) followed by passage 
through a G75 Sephadex column (lane 2), subjected to 
HaDodSO^/polyacrylamide gel electrophoresis , and silver stained 
following treatment of gels with 0.2% periodic acid. 

Figure r . Silver stain of partially and totally stripped core 
protein from milk mucin. The purified milk mucin was deglycosylated 
by treatment with anhydrous hydrogen fluoride for 1 hr at 0 # C (lane 1) 
and S hr at room temperature (lane 2), separated by electrophoresis 
through a NaDodSO^/polyacryl amide gel (10%) and silver stained. 

Figure q „ Immunoprecipitation with EAbs of in vitro translated 
protein products from MCF-7 poly(A) + RNA. Poly(A} + RNA from MCF-7 
cells was translated in a rabbit reticulocyte lysate system (Amersham) 
in the presence of [ 35 S] methionine (1000 Ci/mmole; 1Ci = 37 GBq) 
following the manufacturer 1 s conditions. Samples containing 5 x 10* 
acid precipitable cpm were precipitated with KAbs SM-4 (lane a) , SU-3 
(lane b), HMFG-2 (lane c), HMFG-1 (lane d) and an irrelevant UAb to 
interferon (lane e, 24), separated on a HaDodSO^/polyacrylamide gel 
(10%), impregnated with Amplify and exposed to XAR-5 film at -70 9 C for 
20 days . 
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Figure 10 , Immunoblot analysis of fusion proteins from the Xmuc 
clones. The phage clones XMUC 3,4,6,7,8,9 and 10 were used to 
lysogenize bacterial strain Y 1089. Imogens were grown at 32 *C, 
shifted to 42 *C, and then induced with XPTG. Lysogen proteins were 
fractionated by electrophoresis through a NaDodSO^/polyacrylamide gel 
(7.5%), transferred to nitrocellulose, and reached with Hl£FG-2. The 
binding was detected with an ELISA method using 4-chloro-1 -naphthol as 
the substrate. The numbers are those of the X clones. 

Figure 11 . -Hybridization of plfUCIO to cDNA inserts of pMUC clones. 
DNA from the plasmid clones was digested with restriction enzyme EcoRI 
to excise the cDNA inserts, separated by electrophoresis on 1.4% 
agarose and transferred to Biodyne nylon membrane. The filter was 
hybridized using standard conditions (34) to the insert from plfUCIO 
which was labelled with [a- 52 p]dCTP by the method of random priming 
(41). Lanes: plasmid clones 3,4,6,7,8,9,10. 

Figure 12 . RNA blot hybridization analysis of mammary breast mucin 
mRNA. 10 jig of total RNA from human breast cancer cells UCF-7 (lane 
1 ) and T47D (lane 2) , normal human mammary epithelial cells HuME (lane 
5), human embryonic fibroblasts ICH? 23 (lane 4), Daudi cells (lane 5) 
and carcinosarcoma HS578T cellB (lane 6) were separated in a 1.5% 
agarose/glyoxal gel, blotted on to nitrocellulose and hybridized to 
the plfUCIO EcoRI insert which was labelled with [a- 52 p]dCTP by the 
method of random priming (41). The size markers were 28S (5.4 kb) and 
18S (2.1kb) rRNAs • 
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Polymorphic human DNA fragments detected by 

hybridisation with pUUCIO probe. Genomic DNA samples prepared from 

the white blood cells from ten individuals (six unrelated) and from 

three cell lines were digested to completion with Hinfl and EcoRI, 

fractionated by electrphoresis through 0.7% and 0.6% agarose, 

respectively, and transferred to Biodyne nylon membranes. The filter 

was hybridized to the pliDCIO DNA insert which was labelled with 
52 

[a p]dCTP by the method of random priming (41). X-ray film was 
exposed for 1 day at -70*C with intensifying screens. Lanes 1-4 
father, two daughters and mother, lanes 5-10 unrelated individuals, 
lane 11 is MCF-7, lane 12 is ZR75-1, lane 13 is ICRF-23. The DNA 
samples exhibit a wide distribution of sizes. Numbers indicate length 
of DNA in kb. The apparent bands at 23S> are in lanes 12 and 13 are 
artefacts introduced in autoradiography 
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Example 1 

Purification of the milk mucin 

The ailk mucin was purified from human skimmed milk by passage 
through an HkCFG-1 affinity column followed by site exclusion 
chromatography. 3he EK7G-1 monoclonal antibody was purified from 
tissue culture ■upernatant using a protein A column (1). The purified 
antibody was coupled to cyanogen bromide activated sepharo&e 
(Pharmacia) as described in the manufacturer's instructions . Human 
skimmed milk was passed in batches of 100 ml through the antibody 
column followed by extensive washing with PBS. Bound antigen was 
eluted from the column using 0.1 K glycine pH 2.5 and the fractions 
registering an optical density at 260na were pooled, dialysed against 
0.25 U acetic acid and lyophilized. Batches of about 20 mgs were 
dissolved in 0.25 U acetic acid and passed through a G75 Sephadex 
column (1 x 100 cm) which had been previously equiligrated with acetic 
acid. The column was washed with 0.25 V acetic acid and 1ml fractions 
collected. The peak fractions which were eluted in the void volume 
were pooled, lyophilixed and the dry powdor stored at 4 # C. Amino acid 
analysis was performed using a Beckman 6300 amino acid analyser. 

Deglvcosylation of the milk mucin 

To remove the 0-linked carbohydrate from the silk mucin the 
molecule was treated with anhydrous hydrogen fluoride as described by 
Kort and Lamport f21) $ for either 1 hour at 4*C which produced a 
partially stripped preparation, or S hours at room temperature which 
produced the extensively stripped mucin. 
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Iodination of the milk mucin 

lodinations of the purified mucin, the partially or extensively 

■tripped mucin were carried out using the Bolton and Hunter method 

(51). Briefly, the mucin, 2.5 ug in 20 jd 0.111 borate buffer pH 8.5, 
was added to the dried Bolton and Hunter reagent (t mCi, Amerahan 

International pic) and incubated at room temperature for 15 minutes. 
The reaction was stopped by the addition of 0.5 ml of 0.2M glycine in 
borate buffer and after a further 15 minutes incubation, free Bolton 
and Hunter reagent was removed by passage through a G25 Sephadex 
column (FD10 columns, Pharmacia) previously equilibriated in PBS. 

Iodination of Lectins 

Wheat germ agglutinin (WGA) , peanut agglutinin (PNA) (Vector Labs) 
and Helix pomatia agglutinin (HPA) (Boehringer) were iodinated as 
described by Karlsson at al. (52) using the chloramine T method. 

Polvacrvlamide gels and Western blots 

Poly acryl amide gel electrophoresis and immunoblotting was 
performed as described previously (1). Briefly, samples were run on 
5-1 5% polyacrylamide gels and then electrophoretically transferred to 
nitrocellulose paper (Schleicher and Schuell) at 50 volts overnight 
*t 4*C (36). In the immunoblotting experiments the paper was reacted 
with monoclonal antibodies and binding detected with an ZLISA method 
main* 4-chloro-1 -naphthol as the substrate. For lectin binding 
studies the Western blots were reacted with the iodinated lectins as 
described by Swallow et al. 68). 
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Production of monoclonal antibodies 

A female BALB/c mouse was imaunixed with 5 pg of the partially 
stripped milk mucin in Preund's complete adjuvant and S months later 
boosted with a further 5 pg of the same preparation in Freund f s 
incomplete adjuvant. After a further 20 days, 5 pg of the mucin 
extensively stripped of its carbohydrate was given intravenously in 
•aline solution. the spleen was removed 4 days later, and fused with 
the HS2 mouse myeloma eell line (53). 

Screening of hybridoma supernatant and innunoprecipitations 

The screening assay was a modification* of that described by Melero 
and Gonzalez-Rodriguez (54^. Ifultiwell plates were coated with 50 pi 
of 0.1 mg/ml protein A (Pharmacia fine Chemicals) in PBS and allowed 
to dry overnight at 37 *C. The plates were blocked with S% BSA for 1 
hour at S7*C followed by the addition of 50 pi of rabbit anti-mouse 
immunoglobulin (DAXO, diluted 1:10 in PBS/BSA = PBS/BSA). After 
incubating for 2 hours at S7 # C the plates were washed twice with PBS 
containing 1% BSA and 50 pi of hybridoma supernatant added, The 
plates were incubated overnight at 4*C, washed twice with PBS/BSA and 
50 pi of iodinated partially stripped mucin containing 100,000 cpm 
added to each well, Ike plates were then incubated at room 
temperature for 4 hours, washed 4 times with PBS/BSA and the 
individual wells counted in a gamma counter, For immunoprecipitation 
experiments 50 pi of SDS sample buffer containing dithiothreitol was 
added to sach of the wells which were then boiled for 5 minutes and 
the buffer loaded onto 5-15% polyacryl amide gradient gels. 
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StaininE of tissue sectiong 

Tissues from- primary mammary carcinomas, benign breast biopsies, 
normal breast, and pregnant lactating breast tissue were fixed in 
methacara (methanol chloroform and acetic acid 60:30:10) and embedded 
into paraffin wax. Sections were stained with the antibodies using 
the indirect peroxidase anti peroxidase method as previously described 
(47). 

Results 

Purification of the milk mucin 

The milk mucin was purified from human skimmed milk on an Hi£FG-1 
antibody affinity column. Iodination of the eluted material revealed 
the presence of a large molecular weight component and a 68KD band. 
Precipitation of the affinity purified material with antibodies HHFG-1 
and HMFG-2 (tracks 2 and 5) followed by gel electrophoresis showed 
that both the high molecular weight components and the 68ZD component 
were precipitated by both antibodies (less effectively by EUFG-2). 
Since the 68ID component was also precipitated by two unrelated 
antibodies (figure 1, tracks 5 and 4) and this component was not 
evident on an immunoblot of the purified material reacted with HKFG-1 
(figure 2A) f the 6BK component was removed by molecular sieve 
chromatography on a 075 column. The final purified product showed a 
major high molecular weight band, with only a trace of the 6BL 
component and a minor contaminant around 1 4X (figure 2B) % 
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A high molecular weight glycoprotein (PAS-0) containing more than 
50% carbohydrate in 0-linkage has been purified from the human milk 
fat globule by Shimixu and lamauchi (8). To see whether thiB 
component was similar to the mucin isolated from milk by affinity 
chromatography on an affinity column, the amino acid 

compoaition of the purified HKFG-1 reactive mucin was determined and 
compared to the amino acid composition of the purified PAS-0 
component. Table 1 shows that there is good correspondence between 
the two sets of data, indicating that the core proteins of PAS-0 and 
the mucin purified here are the same. 

Isolation of the core protein of the milk mucin 

As there are no enzymes easily available that are efficient at 
removing 0-linked sugars, and p elimination often results in damage to 
the protein core, the oligosaccharides were removed by treatment of 
the mucin with anhydrous hydrogen fluoride. This treatment haB been 
shown by Mort and Lamport (211 to be effective in removing sugars from 
pig submaxillary mucin without damaging the protein core. Amino acid 
analysis of the material produced after HP treatment of the milk mucin 
auggested that the protein core was also in this case undamaged, since 
the composition was the same as that seen in the intact mucin (Table 
1)- 

Initially the milk mucin was exposed to HP for only 1 hour at 4 # , 
but analysis of the product showed that there was only partial removal 
of the sugars with such treatment, and it was necessary to treat the 
mucin at room temperature for 5 hours to obtain a molecule which 
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•shoved no lectin binding ability. Figure 3 shows an autoradiograph of 
the iodinated products after treatment for 1 hour at 4* (track 2) or 3 
hours at KT (track 1). It can be seen from figure 3 that the milder 
treatment results in a mixture of products made up of high molecular 
weight material which is slightly smaller than the intact mucin and a 
number of smaller bands. After longer exposure to EF at room 
temperature, the high molecular weight bands disappeared resulting in 
polypeptide bands of about 6 BID and 72ED. 

To test for the presence of sugars on the intact mucin and on the 
products produced after the two different KF treatments each 
preparation was subjected to acrylaside gel electrophoresis , 
transferred to nitrocellulose paper and reacted vifh "I-labelled 
lectins* The lectins used were peanut lectin (PNA) vfcich reacts with 
galactose linked to X-acetyl galactosaaine c wheat germ (WGA) reactive 
with X-acetyl glucosamine and Helix pomatia agglutinin (HPA) which 
reacts with the linkage sugar in 0-linked glycoeylstion, 
K-acetylgalactoeamine. Figure 4 shows autoradiographs of the reacted 
blots, and.it can be seen that while treatment wilt K? for 1 hr at 4" 
(track 2) alters the lectin reactivity of the mucin, carbohydrate is 
still present. Interestingly, however, there is a much lower level of 
binding of PNA to the high molecular weight material of the partially 
stripped mucin than is seen with the intact mucin (track 1). 
Moreover, this loss in PKA binding ability is accompanied by binding 
of the linkage sugar specific lectin HPA. This lectin shows no 
binding at all to the intact mucin, and the changed pattern of lectin 
binding after limited treatment with HP indicates that sugars masking 
the 0-linked N -acetylgalactosamine have been stripped off. The 
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•mailer component Been In both the intact mucin (track 1) and in the 
partially stripped preparation (track 2) is a glycoprotein which 
reacts with VGA, although not with FNA. This may correspond to the 
component of similar molecular weight (around 6BI) seen after affinity 
chromatography of the cucin and may represent an intermediate 
precursor molecule. 

figure 4 shows dearly that the 681 and 721 components produced 
after extensive treatment with H? (5 fcr at RT) , show no reactivity 
with the lectins (track 5), including the *' -a.ee tylgalaetOB amine 
specific lectin KPA. This observation constitutes strong evidence 
that the sugars have been recovedfrom at le&Bt the majority of the 
molecules, and we will refer to this preparation as the extensively 
stripped cucin. 

Generation of monoclonal antibodies to the milk mucin core protein 

A fusion was carried out using the spleen of a mouse that had been 
immunized with two injections of the partially stripped milk mucin 
follo»9d by a boost with th6 extensively stripped mucin. The clones 
were initially screened against the 125 I partially stripped material 
using protein A plates (see Methods). ?our hybridomaa were selected 
and cloned, and table 2 shows their spectrum of reactivity with the 
intact, partially and extensively stripped mucin. As can be seen from 
this table three of the hybridomaa which were isolated showed a strong 
reaction with the partially and extensively stripped mucin and did not 
react with the intact mucin. These appeared to be good candidates for 
monoclonal antibodies to the protein core and two, SU-3 and SM-4, were 
selected to be characterised further. 
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It can also be. seen from table 2 that the HUPG-1 and H£FG-2 
antibodies reacted very strongly with the mucin .tripped of its ' 
carbohydrate. These two antibodies were, in fact, developed using the 
intact mucin (from the milk fat globule) as immunogen and. in the ease 
of HIFG-2, growing auaaary epithelial cells (U>. Iheir reaction with 
the atripped srucin was unexpected, as circumstantial evidence had 
previously led to the belief that carbohydrate might form at least 
part of their antigenic epitopes. 

Molecular Weight of molecules earrv^ anti^nlr. determinants 

3he antibody SK-3 was shown to be of the IgGl aubclaas. while the 
SU-4 antibody was found to be Igll. We therefore chose to use the SU-3 
antibody in subsequent experiments aince antibodies of the IgM class 
can present problems in some appliction. Immunoprecipitation of the 
extensively stripped material with SM-3 ahowed a reaction with the 
lectin unreactive 66X component (Figure 5A, track 3). The monoclonal 
antibody HMFG-2 can also be seen to immune precipitate the 
lectin-unreactive 68Z component (track 2). The antibodies were 
reactive with antigen on immunoblots and Figure SB shows the reaction 
of antibody SU-S with the dominant 68Z band of the extensively 
stripped mucin (track 2) . 

We have previously shown that the molecular weight of the 
components in breast cancer cells carrying determinants found on the 
milk mucin is lower than 400* and can wary from one tumour to another 
(1). Reaction of antibody SU-S with Western blots of gel separated 
extracts of breast tumour cells shows that this antibody reacts with 
components of similar molecular weight to thoee reactive with antibody 
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HHPG-2 (data not shown). Because the antibody SM-3 differs from the 
antibodies &E7G-1 and 2 in that it does not react with the intact 
mucin processed by the lactating gland and yet reacts with molecules 
processed by breast cancer cells, it was appropriate to examine the 
reaction of SU-5 with a range of breast cancers . 

Reactivity of SM-S wi th breast tissues and tumours 

The antibody SU-3 reacted with paraffin embedded tissues provided 
these were fired in methAcarn (not formal saline). Using this method 
for preparation of tissue sections, the reaction of the antibody was 
compared to that of HMFG-2 on breast tissues and tumours with an 
indirect imaunoperoxidaae staining method. This analysis ahowed a 
dramatic difference in the staining pattern of SM-3 compared to that 
seen with HMFG-2. Thus, although a strong positive reaction was seen 
in 20/22 breast cancers stained with SU-3 (as compared to 22/22 
stained with HMFG-2) , normal resting breast, pregnant or lactating 
tissues and most benign lesions were largely unstained with SU-3 but 
were stained with HHFG-2. Some examples of staining patterns of breast 
tissues and tumours are illustrted in Pigure 6. 

Twenty-two primary carcinomas and fourteen benign lesions were 
examined and the reaction of SU3 compared to the ataining with HMFG-2 
in each case. In the primary carcinomas , staining with SU-Z was 
heterogeneous but generally <juite strong and always confined to tumour 
cells; connective tissue and stroma ahowed no reaction (see figures 
6A f B). la the four fibroadenomas examined, ataining of the epithelium 
with HMFG-2 was strong although heterogeneous. In comparison, 
staining with SM-3 was negative in one case and in the three otherB 
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staining was confin d to only on or two" glandular elements. Ifl£PG-2 
showed strong positivity on the five papilloaas and five eases of 
cystic disease studied while the staining observed with SH-3 was very 
much weaker and more heterogeneous (figures 6G,H) . Jfce papillomas as 
- a group showed the strongest staining with SU-3, and it can he seen 
that the staining was membranous or extracellular. 

In contrast to BffG-1 and H£F0-2 which strongly stain lactating 
and pregnant breast, EU-S was totally negative with three out of six 
caBes of pregnant or lactating breast (see figure 6C and D) . Two 
■ positive eases showed only very weak staining of an occasional cell 
and in the third, staining was confined to two areas of one lobule. 
Again, in contrast to B£?G-1 and B£FG-2 which do react with some 
terminal ductal lobular unite of normal, resting breast (albeit 
weakly) , SI-3 was totally negative on eight out of the thirteen eases 
tested and in the other five eases staining was extremely weak and 
often confined to one or two acini in the tissue section (see figure 
6£ and F). It should perhaps be noted that the intensity of staining 
with HJiFG-2 aeen with normal breast, tissues and benign lesions fixed 
in methacarn was somewhat higher than that reported previously using 
formalin fixed material (50,4^. 

SM was also shown to be negative on sections of normal liver, 
lung, thymus, sweat gland, epididymis, prostate, bladder, small 
intestine, large intestine, appendix, thyroid and skin. The antibody 
•bowed weak positive.. staining only with the distal tubules of the 
kidney, the occasional chief cell of the stomaeh. the occasional duct 
cell of the salivary gland and the sebaceous gland. 
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DiBcusBion 

large molecular weight mucin molecules axe expressed by many 
carcinomas and carry &any of the tumour associated antigenic 
determinants recognised by monoclonal antibodies* These epitopes may 
also be expressed by some normal epithelium, and some monoclonal 
antibodies like Hi£FG-1 react particularly veil with a mucin found in 
normal human milk (1,17). As long as the study of the mucins is 
restricted to their detection with antibodies reactive with undefined 
epitopes, the knowledge of their structure, expression and processing 
Till also be restricted. We have begun to investigate the structure 
and expression of the mammary mucin by isolating the core protein and 
developing antibodies which have allowed us to select partial cDNA 
clones for the gene coding for the core protein . This Example 
describes the production and characterization of these antibodies. 

Treatment of the HUTG-1 affinity purified milk mucin with hydrogen 
fluoride resulted in the appearance of a dominant band of about 681 
daltons and a minor species of about 72KD on SDS acrylamide gels. 
These bands showed no reactivity with lectins, including Helix pomatia 
agglutinin which is specific for K-acetyl g alac to & amine , the first 
sugar in Delinked glycosylation (55). It therefore seems probable 
that this 681 dalton polypeptide represents the core protein of the 
mucin. Supportive evidence for this comes from the observation that 
the antibodies described here, which are reactive with the stripped 
681 component, can precipitate a molecule of this sire from the in 
vitro translation products of m£KA isolated from breast cancer cells 
expressing the mucin . 
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As the milk mucin contains at least 50% carbohydrate (16) ,a 
protein core of only 68ED appears too small if the intact molecule .has 
an observed molecular weight greater than 400ID. However, mucins can 
he composed of small subunits which aggregate and axe held together by 
some form of non-covalent interactions, as yet not understood. ?or 
•xaaple, although the molecular weight of the ovine submaxillary mucin 
haa been reported to be greater than 1 x 10 6 daltons (45^X it has a 
protein core of only 650 amino acids with a molecular weight of 5B,300 
daltons it6 ) . 

An unexpected finding was that the antibodies HHFG-1 and HMFG-2 
which react with the. milk mucin, also ahow a positive reaction with 
the extensively stripped material which showed no lectin binding 
capability. Previous indirect evidence, including the resistance to 
fixation, boiling and reduction, the repetitive nature of their 
epitopes and the appearance of several bands on immunoblots, had led 
to the belief that carbohydrate present on the milk mucin was involved 
in these epitopes. This idea wsb reinforced by the observation that 
lectins could block the binding of HMFG-1 and 2 (1 ) . Ihile it is not 
possible to exclude the possibility that some sugars, undetected by 
the lectin binding experiments, remain on the extensively stripped 
mucin described here , this is unlikely to be the explanation for the 
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reactivity of the antibodies HV£?G-1 and 2. Shis can be aaid aince 
both antibodies have recently been ahown to react positively with 
P-galactosidaee fusion proteinB expressed by phage carrying DNA coding 
for the core protein of the mammary mucin . It appears therefore that 
at least part of each of the epitopes recognised by H£FG-1 and 
contain amino acids but it must be assumed that some of these epitopes 
on the core protein are exposed, i.e. not seared in the fully 
glycosylated molecule. The MffG-2 epitope is however less abundant on 
the milk mucin than the 2fl£FG-1 epitope, while it is readily detectable 
on the mucin molecules expressed by tumours (1). These molecules have 
a smaller molecular weight and may be less heavily glycosylated or 
polymerised. 

Here we have reported the development of new antibodies which are 
reactive with the protein core of the mucin and with the partially 
deglycosylated molecule, but which are unreactive with the fully 
processed mucin produced by the lactating mammary gland. One of these 
antibodies SU-3, which is an IgG1, has been atudied in more detail. 
It has been shown to react with the mucin molecules which are produced 
by breast cancer cells and are recognised by many antibodiea developed 
against the intact milk mucin. It should be emphasised however that 
the epitope recognised by SM-3 which is on the core protein and is 
exposed in the mucin as processed by tumour eella, is not exposed on 
the normally processed milk mucin. Thin feature cfterB the 
possibility of enhanced tumour specificity, and a pilot 
immunohistochemical study of breast tumours and tissues has ahown that 
indeed the SH-3 antibody reacts strongly with the majority of primary 
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breast cancers (91$) but shows little or no reaction with benign 
breast tumours, resting or lactating breast, and most normal tissues. 

There are several implications of the work described here which 
may be important for both basic and clinical rtudies in breast cancer. 
The observation that parts of the core protein (detectable by 
antibodies) are exposed on the mucins aa processed by breast cancer 
but masked on the mucin as processed by cells in normal breast and 
benign tumours implies that there is an alteration in the processing 
of the mucin in malignancy. A more detailed study of the processing 
of the mucin in normal and malignant cells may then give basic 
information for defining the malignant cell. Moreover, since the 
specificity of the reaction of the antibody SH-3 for tumours is better 
than that of antibodies developed against the intact mucin, this 
antibody may prove to be a more effective diagnostic tool for the 
detection, of breast cancer cells in tissue sections, tissue fluids and 
cells. The reactive components are membrane associated as well as 
intracellular and in vivo localisation of tumours may also be 
possible. 
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Abbreviations 

She abbreTiations used are: HHFG, human milk fat globule; PBS, 
phosphate-buffered saline (153 ali HaCl, 5 wM ZCL, 10 aM Ka.HP0 , 2 mM 

Di 2 P0 4 pH WGA ' * hoat * erm ««l«tlaiai PHA. peanut agglutinin; 

HPA, Helix pomatia agglutinin; BSA, borine serum albuain; SDS, aodium 
dodecyl sulfate. 
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Example 2 • 

Purification and deglycosylcrfci6n..of hu-tian milk mucin was 
conducted as in Example 1 mucin was purified on an EMFG-1 
antibody. 

The stripped mucin preparations were separated by electrophoresis 
through HaDodS0 4 /polyacrylamide eels (10*) and silver stained by two 
methods, one of which can be used to stain highly glycosylated 
proteins (22,23). 

Preparation of polyclonal rabbit antiserum to stripped core protein 

One New Zealand White rabbit was immunized with 100 jig of the 
partially stripped core protein in complete Preund'e adjuvant (Gibco). 
Booster injections of 500 fig of the totally stripped core protein were 
administered in incomplete Preund's adjuvant (Gibco) 3 and 4 weeks 
after the initial injection and the rabbit was bled one week later. 
Ten microliters of immune serum (75 jig/ml protein) precipitated 200 ng 
of fully stripped core protein in a Protein A assay (24) and detected 
it on immunoblots. The immunoglobulin fractions of rabbit preimmune 
and rabbit anti-mucin core protein were prepared by adding amm onium 
sulfate to 50% saturation. The resulting pellet was resuspended in 
one-half the original serum volume of PBS and dialyxed against the 
same buffer* ifter dialysis, only residual precipitate was removed by 
centrifugation. Immunoglobulin fractions were stored in aliquots at 
-20 *C. 
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Description of MAbs used 



In addition to the polyclonal antiserum used for initial screening, 



mucin core protein (20) and HHFG-1 and HM7G-2 (1,14) were u«d to 
screen the purified plaques, the p-galactosidase fusion proteins and 
for iaaunoprecipitations from in vitro translated proteins*. Other 
KAbi used vere a monoclonal anti-0-g alaetosidase antibody (25) which 
was a gift froa H. Durbin (ICKF, London), an snti-interferon antibody, 
ST254 (24), I£61, a keratin antibody (26) and M1 8 which recognises a 
carbohydrate structure on the milk mucin (27). 

In Vitro translation of proteins 

RNA was isolated froa the human breast cancer cell line MCF-7 
using the guanidium isothiocyanate method of Chirgwin at al. (28) and 
poly(A)* RNA was purified by chromatography using oligo (dT)-cellulose 
(New England Bio Labs). The poly(A) + RNA was translated in a 
reticulocyte lysate system (Amersham) in the presence of [ 55 S] 
methionine (1000 Ci/mmole; 1 Ci = 57 GBq, Amersham). Samples 
containing 5x10* acid insoluble cpm were precipitated in a protein A 
assay (24) using ICAbs SM-3, SI-4, HMPG-1 , HUFG-2 and a control 
antibody to human interferon. Sie antibody-selected proteins were 
then separated on t 10) BaDodSO^/polyacrylamide gel, impregnated with 
Amplify (Amersham) and exposed to XAR-5 film (Kodak) at -70 *C. 

*The HAbs S(-3 and SM-4 (SM refers to stripped smcin) show strong 
reactivity with the partially and fully stripped core protein but no 
reactivity with the fully glycosylated mucin (20). 



a cocktail of two MAbs, SM- 



l-3 and SH-4 (see Example 1) which recognise 
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Antibody screening of Xgtn library and protein blotting 

The Xgt11 library used in this study was constructed from mRNA 
isolated from the human breast cancer cell line UCF-7 ar ** was 
generously provided by Philippe Walter and Pierre Chambon (Strasbourg, 
France) . The poly (A) + EHA used for the preparation of the randomly 
primed library was prepared from &KNA that sedimented faster than 28S 
rRNA and was enriched in estrogen receptor (29). The library was made 
essentially as described by fiuynh et al. and Xoung and Davis (50-32) 
and contained approximately 1 x 10 6 recombinants per jig of RNA. 
Between 85% and 95% of the plaques contained inserts. 

The phage library was plated onto bacterial Btrain XI 090 and grown 
for 3 hr at 42 # C. After isopropyl 0-D-thiogalactoside (IPTG) 
induction and 3 hr of growth at 37 # C, filters were prepared from each 
plate and screened with anti-mucin core protein antibody by the method 
of Xoung and Davis (32) . The first; antibody used in screening was the 
rabbit antiserum raised against the stripped core protein prepared as 
described above. Prior to use in screening, the antiserum was diluted 
1:200 in PBS containing *\% bovine serum albumin (PBS/BSA) . 
Preabsorption with XI 090 bacterial lysate was not found to be 
necessary. The nitrocellulose filters (Schleicher and Schuell) were 
blocked by incubation in PBS containing 5% BSA for 1 hr at room 
temperature with gentle agitation. The filters were incubated at room 
temperature overnight with a 1 :200 dilution of antiserum in heat 
sealed plastic bags, fixe' filters were washed 5x5 sin in PBS/BSA, 
and bound antibody was detected by using horseradish 
peroxidaBe-conjugated sheep anti-rabbit antiserum (Daxo) diluted 1:500 
with PBS/BSA and incubated f r 2 hr with the filters. Th filters 
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were washed 5x5 min in PBS/BSA and 1x10 min in PBS before color 

detection using 4-chloro-1 -napbthol (1). Inmuno re active bacteriophage 

were picked and purified through two additional rounds of screening. 

Subsequently, bacteriophage inserts were subcloned into the EcoRI 

sites of pUC8 (53) producing the plasmid used most extensively, pMUC 

10* The plasmids were maintained in DH1 cells. 

To examine the p-galactoBidase-cDNA fusion proteins for 

immunore activity, cell lysates were derived. I^sogens were prepared 

as described in Young and Davis (34). Cells were pelleted, suspended 

in Laemali sample buffer (35) and separated by electrophoresis through 

NaDodSO /polyacrylamide gels (10%) and transferred onto nitrocellulose 
4 

filters as described (1,36). The filters were treated as above for 
antibody screening. 

Northern Analysis 

RNA was isolated from tissue culture cells and frozen tissues by 
the guanidinium isothiocyanate method of Chirgwin et al . (28). Total 
HJJA (10 yg per lane) was denatured by heating at 55 # C for 1 hr in 
deionized glyoxal and fractionated by electrophoresis through a 1.3% 
glyoxal gel (38). The RNA was transferred to nitrocellulose 
(Schleicher and Schuell) , prehybridixed and hybridized as described by 
Thomas (54). Filters were washed down to 0.11 SSC with 0.1$ BDS at 
65"C and exposed to XAR-5 film (Xodak) at -70 *C with intensifying 
screens • 
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Southern analysis 

High molecular weight genomic DNA was prepared from white blood 
cells and cell lineB (39,40). These genomic CKAs (10j:g) were cleaved 
with restriction enzymes following the manufacturer 1 s recommended 
conditions and fractioned through 0.6% and 0-7% agaroee gels*. Cloned 
plasmid DNA was cleaved and fractionated on.l.SJ agarose. The gels 
were denatured, neutralized and transferred to nylon membranes 
(Biodyne) according to the manufacturer's instructions. The EcoR1 
insert from pMUC1 0 was separated on a 1g low melting point agarose 
(Biorad) gel. and labelled with [a- 52 P]dCTP by the method of random 
priming (41) and hybridized to* filters at 42 "C. Filters were washed 
down to 0.1Z SSC with 0.1% SDS at 55*C and exposed to ZAR-5 film 
(Xodak) at -70 *C with intensifying screens. 

Results . 

Purification and deglycosylation of mucin glycoprotein 

Mucin glycoprotein reactive with the monoclonal antibody HMFG-1 
vaa prepared from pooled human breast milk by using an HUFG- 1 antibody 
affinity column, followed by molecular sieve chromatography on 
Sephadex G-75 in order to remove lower molecular weight components 
(Figure 7. lane 1). In order to demonstrate the homogeneity of the 
purified molecule, amino acid analyses of four separate preparations 
were performed and revealed a fairly consistent composition with 
serine, threonine, proline, alanine and glycine accounting for 58$ of 
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the anino acids 



Periodic acid silver stained gels 



revealed a diffuse band of greater than 400,000 daltons visible only 
when the gel was treated with periodic acid before the silver stain 



visualised on the gel using the silver stain without prior treatment 
with periodic acid. 

The purified material was subjected to treatment with hydrogen 
fluoride to remove the 0-linked sugars that are characteristic of 
mucin glycoproteins. Two different reaction conditions were used 
which resulted in a partially deglycosylated core protein (treated at 
0 # C for 1 hr) and a fully deglycosylated core protein (treated at room 
temperature for 3 hr) as determined by iodinated lectin binding 
following separation by gel electrophoresis and transfer to 
nitrocellulose paper (20). The partially deglycosylated core protein 
was reactive with wheat germ agglutinin, peanut agglutinin and helix 
pommatia lectin (which recognizes the linkage sugar 
K-acetylgalactoeamine) whereas the fully Btripped protein showed no 
reactivity with any of these three lectins. 

The hydrogen fluoride treated core protein was separated by 
electrophoresis through HaDodS0 4 /polyacrylamide gels (10J&) and silver 
stained. Silver staining revealed that the predominant component of 
the partially stripped mucin was a high molecular weight band of about 
400 kd, although faint bands of lower molecular weight could also be 
observed (Pig. 8 . lane 1). Since the high molecular weight material 
showed a somewhat increased mobility in the gel and reacted with the 
lectin recognising the linkage sugar v it can be assumed that some 
sugars had been removed. The fully stripped mucin consisted of two 
bands of about 68 led and 72 kd (Fig. lan 2). 



(Fig. 7, lane 2). Ho other lower molecular weight bands were 
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Antibody reactive -proteins "produced by MCP-7 cells . 

The MCP-7 breast cancer cell line expresses large amounts of 
HMFG-1 and -2 reactive material on its cell surface (14) and was thus 
judged to be a suitable source of mKKA for a cDNA library. Before 
proceeding to screen the MCF-7 library with the monoclonal antibodies , 
they were tested for their ability to precipitate a component from in 
vitro translation products produced from MCF-7 mRMA. Poly (A) + RHA 
from MCF-7 was prepared and translated in vitro . Proteins from the 
translation reaction were immunoprecipitated using the monoclonal 
antibodies HMFG-1, HMFG-2, SM-3 and SM-4 and displayed by 
polyacrylamide gel electrophoresis and fluorography (Pig. 9). Two 
proteins of about 68 kd and 92 kd were immunoprecipitated by SM-3 
. (lane 2} and SM-4 (lane 1). It was also found that HMFG-1 (lane 4) 
and HMFG-2 (lane 3) immunoprecipitated these proteins; however, no 
bands in these areas were precipitated by an irrelevant monoclonal 
antibody to human Interferon (lane 5). The fact that HMFG-1 and -2 
immunoprecipitated these -proteins was an unexpected finding as it was 
previously thought that these HAbs recognize carbohydrate 
determinants (1). However, we also found that HMFG-1 and -2 react 
very strongly with the fully stripped, iodinated core protein (20). 
These results together with the MAb reactions on the P-galaetosidase 
fusion proteins (see below) confirm that the epitopes for HMFG-1 and 
-2 are, at least in part, protein in nature. 

The abundance of the core protein mRHA in total cellular poly (A)* 
RNA was 4% as estimated by comparing the amount of (' 5 S)methionine 
present as immunopr cipitated protein to th amount f methionin 



SUBSTITUTE SHEET 



WO 88/05054 



-43 ■ 



PCT/GB88/00011 



incorporated into total protein during in vitro translation. 

Screening of the cDNA library 

The Xgt11 cDNA library made from size selected MCF-7 mRNA (see 
Methods) was screened initially with the polyclonal antiserum made to 
the mucin core protein which had been stripped of its carbohydrate. 
Screening of 2 x 10 plaques resulted in 11 positive clones, 7 of 
which were taken successfully through two further rounds of plaque 
purification. 

To demonstrate that the reactivity of the phage clones with the 
antibody probes was due to antigenic determinants on the cDNA 
translation product, fl-galactosidase fusion proteins were made from 
all 7 clones. The proteins were separated by electrophoresis, 
transferred to nitrocellulose paper and probed with a variety of 
antibodies to the stripped mucin, including the polyclonal antiserum 
which was used initially to select the clones and a cocktail of SM-3 
and SM-4. In addition, HHFG-1 and HMFG-2, the two monoclonal 
antibodies which originally detected this differentiation and 
tumour-associated epithelial mucin (1,14) were tested. All 7 clones 
yielded fusion proteins which were specifically recognized by the 
polyclonal antiserum, the monoclonal cocktail, and HUFG-2. HMFG-1 
antibody reacted with 6 of the 7 fusion proteins and failed to 
recognize the protein from clone 9 which contains the smallest insert. 
In every case the strongest signal was given by the HMFG-2 antibody 
and this reaction is shown in Figure io. Monoclonal antibodies to 
keratins and to a carbohydrate epitope on this fully glycosylated 
mucin were used as controls and showed no reactivity 
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A monoclonal antibody to 0-galactosidase was a positive control and 
the band recognized correlated in every case with the band recognized 
by the specific antibodies. The sizes of the fusion proteins varied 
in proportion to the sizes of the cDNA inserts found in the 
bacteriophage . 

Characterization of cDNAs and RNA blot analysis 

The inserts from the X clones were designated pHUC5-1 0 (omitting 
pMUC5) and were subcloned into the vector pUC 8 for easier 
manipulation. The 7 clones were compared to each other for sequence 
homology. Each of the plasmids was digested with EcoRI and the insert 
separated on a 1.4% agarose gel. The largest cDNA insert from pMUCIO 
was used to probe the inserts and found to hybridize to all 6 inserts 

(Pig. 11). pMUC 7 was found to contain two inserts following digestion 
with EcoRI; however, only 1 of the inserts hybridized to the pMUClO 
probe. The insert bands were not derived from phage DNA since the 
pMUCIO probe did not hybridize to Hind Ill-digested X phage DNA 

As shown by agarose gel electrophoresis (Pig .XI) , the 
inserts vary in size from about 200 to up to about 1800 bp. The 
largest insert from pMUC10 has been used as the hybridization probe in 
all subsequent experiments . 

Because the JLUUC clones were identified only by antibody binding, 
we needed additional assurance that they were indeed coding for the 
breast epithelial mucin. To determine the authenticity of pl£UC10, we 
correlated the presence of mRNA hybridizing to the clone with mucin 
expression in various cell lines. As shown in figure the cDNA 
hybridiz d to two transcripts of 4.7 fcb and 6.4 kb in th RNA from the 
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breast cancer cell lines MCP-7 and T47D which were shown previously to 
express the HHFG-2 antigen (1,14). Significantly, the pMUC1 0 probe- 
hybridized to transcripts of approximately the same size in RNA 
extracted from normal mammary epithelial cells cultured from milt 
(42). A third band of 5.7kb can be seen in the RHA from these normal 
cells. In contrast, three human cell types that lack the mucin, 
breast fibroblasts, Daudi cells and HS578T, a carcinosarcoma line 
derived from breast tissue (43), showed no detectable pVTDCI 0 -related 
mRNA. The 6.4 kb band appears to be the most adundantly expressed. 
The presence of at least two sizes of mKNA from MCF-7 cells correlates 
with the immunoprecipitation of two proteins of (molecular weights 68 
kd and 92 kd) from in vitro translated mKNA from MCF-7 cells. The 
normal mammary epithelial cells were derived from pooled milk samples 
and the additional transcript observed may be due to polymorphisms 
among individuals. 

Genomic DNA blot hybridization and detection of a restriction fragment 
length polymorphism (RFLP) 

Genomic DNA was prepared from a panel of ten individuals 
consisting of six unrelated individuals and a family of four, and from 
three cell lines. The DNAs which were digested with Hinfl or EcoRI 
and blotted and hybridized to the radiolabeled pUUCIO insert, exhibit 
restriction fragment length polymorphisms. Die restriction fragments 
from the ten individuals and three cell lines are shown in figure 13. 
The pattern consists of either a single band or a doublet of sizes 
ranging from MOObp to 6200bp in the Hinfl digest (with the exception 
of the ZR75-1 DNA in lane 12, f igure 13A which shows three bands) or 
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from 8200bp to 96Q0bp in the EcoRI digest (Figure 13 B) . There appears 
to be a continuous distribution of the fragment sizes which implies a 
bigh in vivo instability at the locus. The pattern of fragments 
observed in the family of four (lanes 1-4) suggests that these 
fragments are allelic. Preliminary studies of the DNA made from white . 
blood cells of normal, related individuals indicate the existence of a 
number of independent alleles with an autosomal codominant mode of 
inheritance These 
studies will be the subject of a separate investigation. 

Discussion 

The cDNA clones described here which were obtained from the l£CF-7 
lgt11 library were selected using polyclonal and monoclonal antibodies 
prepared against a normal cellular product, the. milk mucin in its 
deglycosylated form. This was done because it was easier to obtain 
large quantities of the mucin for stripping than to prepare similar 
quantities of immunologically related glycoproteins expressed by 
breast cancer cells (44). The fact that the antibodies did Belect for 
cDKA coding for nonglycosylated core protein molecules in liCF-7 cells, 
strongly suggests that the glycoproteins in these cells, which were 
originally detected by their reaction with antibodies to the milk 
mucin, contain the same core protein as this mucin. This is confirmed 
by the detection of mRKAs of approximately , the same sizes in the 
normal and malignant cells, using one of the probes isolated from the 
MCF-7 library. We will therefore refer to the antibody reactive 
glycoproteins on breast can er c lis as mucins, bearing in mind that 
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their processing may be different resulting in molecules of different : 
molecular weights but with the same core protein as that of the milk 
mucin. 

Seven clones were obtained from the MC7-7 library of which the 
largest was 1800kb. This clone cross hybridized with the other 6 
smaller clones. Bie p-galactosidaae fusion proteins expressed by six 
of the cross-hybridizing lambda clones were reactive with the 
polyclonal antiserum directed against the mucin core protein as well 
as with four well-characterized monoclonal antibodies directed to 
various epitopes on the stripped core protein, SM-3, SW-4, HMFG-1 and 
HMFG-2 (14,20). The smallest lambda clone, XMUC9, produced a 
p-galactosidase fusion protein which reacted with three of the four 
monoclonal antibodies and with the polyclonal antiserum. 

The surprising result that the extensively characterized 
and HMFG-2 monoclonal antibodies reacted strongly with the lambda 
plagues and the fusion proteins and could immunoprecipitate proteins 
from in vitro translated mRNA provides strong evidence that these 
clones do indeed code for a portion of the mucin core protein, 
Although previous evidence such as resistance to fixation, boiling, 
treatment with dithiothreitol and NaDodSO^ and the presence of 
multiple epitopes on the molecule suggested that these were 
carbohydrate (1 ) , it has now been established that the epitopes of the 
HUFG-1 and KUFG-2 monoclonal antibodies are definitely protein in 
nature. Carbohydrate may be required to obtain the. strongest binding, 
either as part of the epitope or by conferring some conformational 
change on the protein portion, but part of the antigenic determinant 
must consist of an amino acid sequence. Sin e these two HAbe are 
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reactive with the. fully glycosylated milk mucin as well as the 
stripped core protein, this data means that the intact molecule 
contains areas of naked peptide which contribute to the antigenic 
sites for these two antibodies. 

Confirmatory evidence that pMDC1 0 codes for the mammary mucin core 
protein is provided by RNA blots. The relative abundance of mRNA in 
the breast cancer cell lines MCF-7, T47D r ZR-75-1 and in normal 
mammary epithelial cells corresponds to the antigen expression by 
these cells as measured by the binding of the HMFG-1 and HKPG-2 
monoclonal antibodies. Cell, types which are negative for antigen 
expression such as human fibroblasts, Daudi cells and HS578T, a 
carcinosarcoma line derived from breast (14), are negative in RNA blot 
hybridizations. A fortuituous observation made with the ZR-75-1 
cells yielded indirect strong evidence that pHUCIO does indeed code 
for the mucin glycoprotein core protein. This cell line, which 
routinely expresses large amounts both of mRNA and antigen, yielded 
one preparation of RNA which was unexpectedly negative by blot 
hybridization. It was subsequently found that those particular 
ZR-75-1 cells from which the RNA had been made had lost the expression 
of the antigen as well at this time (as determined by reaction with 
HMFG-1 and 2). Different passage numbers of the ZR-75-1 cells were 
recovered and shown once again to express both antigen and message. 
The sizes of the messages, 4.7 kn and 6.4 kb, are quite large, since a 
68 kd or 92 Xd protein would need only about 5 kb to code for the 
protein portions. This suggests that a large portion of the mRNA 
aaybe untranslated. Efforts are underway to obtain a full-length 
clone. 
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Thus, the cDNA clones presented here represent a portion f the 
gene coding for the human mammary mucin which is expressed by 
differentiated breast tissue as well as by most breast cancers* The 
major proteins precipitated from in vitro translation products of RNA 
from VC?-7 cells by antibodies to the milk mucin core protein (68Xd) 
have an apparent molecular weight of 68Kd and 92£d. These proteins, 
produced by the breast cancer cell therefore ahare epitopes with the 
68Xd core protein of the milk mucin (20). Whether a similar 92Ed 
protein is also produced by normal mammary epithelial cells, and is 
truncated or destroyed by HP treatment is not yet clear. MCF-7 cells 
biosynthetically labelled with 14C amino acids yield upon 
immuno precipitation with HMFG-1 and HUPG-2 antibodies, two 
glycosylated proteins of S20 led and 430 ltd, and it 

is possible that each of these glycoproteins utilizes only one core 
protein of either 66£d or 92Xd. Alternatively, each of the 
glycoproteins could contain both the 92Zd and 682Ld proteins either in 
different proportions or variably glycosylated. Further screening of 
the library may yield full length cDNAs coding for both sizes of the 
immunologically related core proteins. Since there appears to be only 
a single gene (based on Southern blot data obtained by using a partial 
cDNA probe) , it is probable that the multiple messages arise by 
alternative RNA splicing and this would explain the fact that they 
contain common sequences. Although a core protein of 68 fed appears to 
be small to yield a fully glycosylated molecule of greater than S00 led 
which contains 50$ carbohydrate* there is evidence that such a 
structure for mucins is possible. Ovine submaxillary mucin has a 
reported molecular weight of 1 i 10 6 daltons (45) , yet its protein 
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core consists of 650 amino acids resulting in a molecule of 58 kd 
(«)• 

The mucins which are detected with H£PG-1 and HHFG-2 MAbs on 
immunoblots of tumours and breast cancer cell lines ahow variations in 
■iza from 80 kd to 400 kd in the molecular weights of the tumour mucin 
molecules (1.47). Using these same antibodies which detect high 
molecular weight mucins preaent in normal urine, a polymorphism ha s 
indeed been shown to be genetically determined (48). Although the 
very low molecular weight components are likely to represent precursor 
forms of the mucin which appears to be incompletely processed in many 
tumour cells (20), the variations in the higher molecular weight 
components are likely to be due to this genetic polymorphism. It was 
unclear, however, whether the structural basia of the polymorphiam" was 
due to the genetically determined protein or to the carbohydrate 
portion of the mucin. The detection of restriction fragment length 
polymorphiams in the Southern blotting experiments using the mucin 
probe augge8t that the mucin polymorphism occurs at the level of the 
DNA which codes for the protein. Preliminary sequence data 
yfeuggest that the basis for this polymorphism is a region 
of variable tandem repeats preaent in the protein coding sequencea. 
ThiB structural feature may be responsible for the generation of the 
many allelic restriction fragments at the mucin locus. We are 
presently investigating the basia of the mucin polymorphism by a 
Southern blot survey of DHA from white blood cells of normal, related 
individuals whose inheritance pattern of urinary mucins has been 
determined. In addition, we are examining DNA preparations made from 
th white blood cells and tumours, f individual br ast can er patients 
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to determine if there 1b any discordance between genotype in the 
paired samples, since tandemly repeated DKA may provide an unstable 
site where recombination or amplification could occur. 

The presence of nrucinB in the majority of carcinomas and their 
association with the differentiation of mammary epithelial cells makes 
it particularly important to identify regions involved in the tissue 
specific and developmental regulation of the gene. Moreover, the 
introduction of a functional mucin gene into cells should provide 
insights into the role of this molecule in breast epithelial 
differentiation and possibly enable us to identify any alterations in 
the function or expression of the mucin which are related to malignant 
transformation in the human breast. 
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Abbreviation 

The abbreviations are as follows: PBS, phosphate-buffered saline; 
MAb. monoclonal antibody; IPTG, isopropyl 0-D-thiogalactoside; bp, 
base pair (s); Xb, kilobase(s). 
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TABLE 1 

Anino acid coaposltion of the human milk aucln - cogparieon with PAS-0 

Amino acid HHFG-1 purified Zzteneiyely atripped PAS-0 

milk mucin nilk aucin (Shimiru & lanauchi 

19B2) 



AjBV 


6.1 


•7 9 
I • C 


6.4 


Thr 


9.4 


V • f 


ft e 


£>er 


9.1 


1S.0 


13.1 


Glx 


6.3 


9.6 


8.3 


Pro 


14.8 


14.4 


12.0 


Gly 


8.1 


10.1 


12.2 


Ala 


12. S 


11.9 


13.0 


Cys 


Mot analysed 


Not analysed 


0.5 


Val 


6.0 


6.S 


5.3 


Met 


0.5 


0.4 


0.B 


lie 


1.6 


1.7 


1.9 


Leu 


4.5 


4.8 


3.7 


Tjr 


2.0 


0.9 


1.6 


Pbe 


2.0 


1.6 


1.7 


Hia 


S.2 


2.S 


3.8 




2.8 


3.3 


2.2 


AT£ 


4.0 


4.0 


3.9 
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Table 2 

Reactivity of the antibodies on intact, r artiallv „nH totally 
deglvcogvl ated milk mucin 



125 



I cpa bound 



Antibody 



Intact molecule Partially Totally 

•tripped mucin stripped mucin 



5.17 


8,524 


9.13 


525 


SM-3 


465 


SM-4 


816 


HMPG-1 


32,000 


HMPG-2 


29,500 


NS2 medium 


- 397 



11,925 
5,000 
15,414 
16,750 
33,768 
29,230 
645 



5,780 
5,328 
9,200 
9,561 
9,494 
15,832 
650 



The binding of the antibodies to iodinated intact, partially and 
totally deglycoeylated milk mucin waa aaaayed uaing the protein A 
plate method as described in Materials and Methods. 
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CLAIMS 

1. An antibody or fragment thereof against a 
human mucin core protein which antibody or fragment has 
reduced or substantially no reaction with fully expressed 
human mucin glycoprotein. 

2. Human polymorphic epithelial mucin core 

protein. 

3. A polypeptide comprising 5 or more amino 
acid residues in a sequence corresponding to the sequence 
(I) 

Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala 
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala 
Pro Gly Ser Thr Ala Pro Pro Ala His Gly 

4. . A polypeptide according to claim 3 having 20 
or more amino acid residues in a sequence corresponding to 
the sequence (I) 

Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala 
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala 
Pro Gly Ser Thr Ala Pro Pro Ala His Gly 

(I) 

5.. A polypeptide according to claim 3 or claim 
4 wherein at least one amino acid residue bears a linkage 
sugar substituent. 

6. A polypeptide according to claim 5 wherein 
the linkage sugar bears an oligosaccharide moiety. 
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7. A polypeptide according to claim 5 or claim 
6 wherein amino acid bearing a substituent is a serine or 
threonine and the linkage sugar is N-acetyl galactosamine . 

8. A polypeptide according to any one of claims 
5 to 7 linked to a carrier protein. 

9. An antibody or fragment thereof against a 
polypeptide according to any one of claims 3 to 8 which 
antibody or fragment has reduced or substantially no 
reaction with fully processed human mucin glycoprotein. 

10. An antibody or fragment thereof according to 
claim 1 or claim 9 against a human polymorphic epithelial 
mucin core protein. 

11. An antibody or fragment thereof according to 
claim 10 against human polymorphic epithelial mucin core 
protein as expressed by a human colon, lung, ovary or 
breast carcinoma. 

12. An antibody or fragment thereof according to 
any one of claims 1 and 9 to 11 which has no significant 
reaction with mucin glycoprotein expressed by pregnant or 
lactating human mammary epithelial tissue. 

13. A monoclonal antibody or fragment thereof 
according to any one of claims 1 and 9 to 12. 

14. A hybridoma cell capable of secreting a 
monoclonal antibody according to claim 13. 

15. A hybridoma cell of the cell line designated 
HSM3 (ECACC 87010701) . 

16. A monoclonal antibody secreted by HSM3 
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(ECACC 87010 701) . 

17. A nucleic acid fragment comprising at least 
17 nucleotide bases the fragment being hybridisable with at 
least one of 

a) the DNA . sequence 

5« * 
ACC GTG GGC TGG GGG GGC GGT GGA GCC CGG- 

GGC CGG CCT GGT GTC CGG GGC CGA GGT GAC- 

* 

ACC GTG GGC TGG GGG GGC GGT GGA GCC CGG- 

3' 

GGC CGG CCT GGT GTC CGG GGC CGA GGT GAC 

b) DNA of sequence 

5" 

GTC ACC TCG GCC CCG GAC ACC AGG CCG GCC- 
* 

CCG GGC TCC ACC GCC CCC CCA GCC CAC GGT- 

GTC ACC TCG GCC CCG GAC ACC AGG CCG 
* 

CCG GGC TCC ACC GCC CCC CCA GCC CAC 

c) RNA having a sequence corresponding to the DNA sequence 
of a) and 

d) RNA having a sequence corresponding to the 



GCC- 
3" 
GGT 
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DNA sequence of b) • 

18. A nucleic acid fragment according to claim 
17 comprising a portion of at least 30 nucleotide bases 
capable of hybridising with at least one of sequences (a) 
to (d) . 

19. A DNA fragment according to claim 17 or 18. 

20. A double stranded DNA fragment comprising 
antiparallel paired portions having respectively sequences 
(a) and (b) as defined in claim 17. 

21. An antibody or fragment thereof according to 
any one of claims 1, 9 to 13 and 16 bearing a detectable 
label or a therapeutically or diagnostically effective 
moiety. 

22. An antibody or fragment thereof according to 
any one of claims 1, 9 to 13, 16 and 21 for use in a method 
of therapy or diagnosis practised on the human or animal 
body. 

23. Human polymorphic epithelial mucin core 
protein bearing a detectable label or a therapeutically or 
diagnostically effective moiety. 

24. Human polymorphic epithelial mucin core 
protein according to claim 2 or claim 23 for use in a 
method of therapy or diagnosis practised on the human or 
animal body. 

25. A polypeptide according to any one of claims 
3 to 8 bearing a detectable label or a therapeutically or 
diagnostically effective moiety. 
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26. A polypeptide, according to any one of claims 
3 to 8 and 25 foe use in a method of. therapy or diagnosis 
practised on the human or animal body, 

27. A nucleic acid fragment according to any one 
of claims 17 to 20 bearing a detectable label or a 
therapeutically or diagnostically effective moiety. 

28. A nucleic acid fragment according to any one 
of claims 17 to 20 and 27 for use in a method of therapy or 
diagnosis practised on the human or animal body. 

29. An assay method comprising contacting a 
sample suspected to contain abnormal human mucin 
glycoproteins with an antibody or fragment thereof 
according to any one of claims 1, 9 to 13, 16 and 21. 

30. A diagnostic or therapeutic method practised 
on the human or animal body comprising administering an 
antibody or fragment thereof according to any one of claims 
1, 9 to 13, 16, 21 and 22. 

31. A diagnostic or therapeutic method 
practised on the human or animal body comprising 
administering human polymorphic epithelial mucin core 
protein according to any one of claims 2, 23 or 24. 

32. A diagnostic or therapeutic method 
practised on the human or animal body comprising 
administering a polypeptide according to any one of claims 
3 to 8, 25 and 26. 

33. A diagnostic or therapeutic method 
practised on the human or animal body comprising 
administering a nucleic acid fragment according to any one 
of claims 17 to 20, 27 and 28. 
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